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THE USE OF SHRINKAGE TECHNIQUES IN 
THE ESTIMATION OF ATTRITION RATES 
FOR LARGE SCALE MANPOWER MODELS 



R. R. Read 

Department of Operations Research 
Naval Postgraduate School 
Monterey, CA 93943-5000 



Abstract 



This report summarizes the research to date performed by the author 
and his students on the use of modern multiparameter estimation tech- 
niques in the building of an attrition rate generator in support of the USMC 
Officer Planning and Utility System (OPUS). Three main areas are identi- 
fied: The cell aggregation problem; the specifics of parameter estimation; 
the need to match forecasting techniques to the specific application. Most 
of the effort has been in the first two of these areas and much has been 
learned. The aggregation problem, i.e., the grouping of personnel cells into 
an appropriate number having common size and attrition behavior, has 
emerged as the most important problem that requires immediate atten- 
tion. Its resolution is expected to lead to a clear policy for multiparameter 
estimation. Estimation and forecasting are both impacted by the nature of 
the data base. It is likely that specific applications will use differing data 
bases and differing statistical techniques as well. 
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EXECUTIVE SUMMARY 



The author and his students have been working with a number of modern 
techniques applied to the problem of estimating attrition (leave the service) rates 
for the numerous cells that appear in manpower planning models for the USMC 
officer corps. Special attenion is given to the “small cell” problem; that is, officer 
categories that normally contain but a few personnel. These cells are numer- 
ous and historical empirical rates for them are generally quite unstable. This 
report will summarize what we have learned to date, and outline the research 
continuation plans. 

Work that has applied shrinkage type estimators to the problem of estimating 
officer attrition rates has been reported in [22,50,57]. The methods tested have 
been successful in the comparative sense. That is, they perform better than the 
raw historical rates that might be used in an ad hoc fashion. But their behavior 
in the absolute sense still has erratic aspects. Moveover we do not have a solid 
way to anticipate the areas of unstable performance. 

The recent acquisition of a much more refined date tape and the theses by 
Larsen and Dickenson [38,22] have lead to greater insight to this problem. In 
particular the new data covers ten years, breaks out officer grade by above zone 
and in or below zone, regular and reserve, unrestricted and limited duty, etc. 
The thesis by Larsen identifies the important break points in the YCS (years 
of commissioned service) scale and some MOS (military occupation specialty) 
categories that must be treated separately. The thesis by Dickinson, in addition 
to pursuing some isolated details that had been treated presumptuously in earlier 
work, introduces an empirical Bayes method that appears to be doing a better 
job of shrinking the raw estimates. It seems to manage better the unevenness of 
the cell inventories. Finally, some of our problems have also been experienced 
by Carter and Rolph [13] so we propose to pursue their suggestions as well. 

Our studies have led us to believe that the most important item in the 
continuation work is the aggregation problem. This problem has two aspects: 
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(i.) The grouping of cells into communities of homogeneous attrition 
behavior. 

(ii.) The combining or amalgamation of cells in order to meet min- 
imal cell inventory requirements. 

This need involves some exploration of the data. Because of the cumbersomeness 
of the data extraction problem, it will be necessary to make wise choices and 
study the most germane collections of cells. 

Based upon the results of Carter and Rolf, we anticipate that an adequate 
solution to the aggregation problem will lead to a clear policy for attrition rates 
generation. Once this is accomplished, we can turn to the specific needs and 
idiosyncrasies of the various application models. This will include questions of 
both short term and long term forecasting. 

1 INTRODUCTION 

In recent years, the Marine Corps has been phasing its manpower management 
into a centrally organized and computerized Officer Planning and Utility System 
(OPUS) [15, 16,17,18,19,20]. This system contains a number of planning models 
and such models are affected by three general factors: existing inventory (per- 
sonnel), projected losses, and projected gains. In order to project the inventory 
into various future time periods, it is necessary to use a realistic system of flow 
rates. Some of the rates are under administrative control, such as promotions, 
job assignments, and of course everyone acquires longevity with the passage of 
time. The attrition flowrates, however, can be anticipated only in a statistical 
sense. By attrition we mean leaving the service for any reason (e.g. resigna- 
tion, discharge, disability, release, retirement) and the circumstances that lead 
to these attritions are not under the control of the planner. (Note: Some attri- 
tions are voluntary and some involuntary. For general purposes we assume the 
planner is not cognizant of the involuntary losses.) 



3 



Our role in support of OPUS is to develop useful attrition rates so that 
losses of this type can be reasonably estimated. Obviously, the replacement lead 
time for planning is seldom small; most replacements ascend into the service 
as young lieutenants; augmentation from the reserves is also used. The cost of 
poor planning is great. Too many planned replacements lead to under utilized 
personnel; too few lead to jobs unfilled and the inability to function as required. 

The purpose of this report is to gather and summarize what we have learned 
about attrition rate generation as it pertains to the USMC officer corps; to 
describe the work in progress; to outline ways to study forecasting methods that 
can serve the individual needs of the various models. Thus sponsors and others 
are given current appraisal. This report also serves as a working document for 
students and other researchers. The terminology and notation are standardized. 

The report is organized as follows: Following this introduction we lay the 
base in terms of details of the problem description, notation, conventions, data 
structure and estimation methods. This section will also include a number 
of satellite issues including a discussion of the measures of effectiveness and 
the validation techniques. Section 3 contains summaries of the seven theses 
[1,22,34,38,50,57,58] that have been written in support of this project and dis- 
cusses how they integrate towards the common goal. Section 4 is devoted to 
a brief discussion of futuristics. It appears important that the researchers fa- 
miliarize themselves with the needs of specific user manpower models. Data 
structures and forecasting methods should be tailored for them. 

2 PROBLEM DESCRIPTION, ISSUES, DETAILS. 

A. General Structure and Notation 



For the macro view it is convenient to think of the officer “cells” as the 
result of cross classifying according to grade (GR), military occupation specialty 
(MOS) and length of service (LOS). It will be seen later that further refinement 
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is useful and sharpens the results. (This will be discussed under Data and 
Conventions.) Some of these cells are large (i.e. have large personnel inventory), 
e.g. the grades of first lieutenant or captain with 3-7 years of service and in the 
combat arms occupations. Those familiar with the Corps realize that there 
are many, many small cells. The GR factor has a pyramid structure; fewer 
officers in the higher grades. Of course, GR is well correlated with LOS, but 
not sufficently so that one of them can be removed from consideration. (Also 
LOS is closely correlated with YCS, years of commissioned service, and there 
are instances for which this distinction is important.) Under MOS we have 
considerable variability in that many officers are designated as qualified under 
several job codes. Some of the codes are robust in that there is a reasonable 
level of transferability; i.e. with a modest amount of training, an officer can 
transfer from one job to another. Other codes have high training costs or high 
levels of specialization; e.g. the aviation communities, and attorneys. Such 
considerations are very important to the manpower planner. They also impact 
upon the way that we build an attrition rate generator because the stabilization 
of rates for small cells will depend upon our ability to gather together small cells 
that have a communality of characteristics. 

The time flow of personnel through the system involves gaining a year on 
the LOS scale each year, periodic advancements (or not) in GR, and changes 
in MOS (responsibilities increase with experience). The USMC normally has 
between 18,000 and 20,000 officers. Although there are dependencies in the 
cell flows we are not prepared to include them in the modeling process of the 
attrition aspects of such a large system. Instead, a binomial distribution model 
is adopted. Further we presume cell to cell independence. The impact of the 
independence assumption will be softened by the way that we aggregate cells, 
and by the estimation technique. 

Although the cells are most numerous and their specifications are the result 
of cross classification, for purposes of study and development we assume that 
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